Picture for Ming Cheng

Ming Cheng

DiffCrossGait: Trajectory-Level Alignment for 2D-3D Cross-Modal Gait Recognition via Latent Diffusion

Add code
May 29, 2026
Viaarxiv icon

Text-guided Feature Disentanglement for Cross-modal Gait Recognition

Add code
May 29, 2026
Viaarxiv icon

DM-ASR: Diarization-aware Multi-speaker ASR with Large Language Models

Add code
Apr 24, 2026
Viaarxiv icon

Walking Further: Semantic-aware Multimodal Gait Recognition Under Long-Range Conditions

Add code
Mar 15, 2026
Viaarxiv icon

Design and Research of a Self-Propelled Pipeline Robot Based on Force Analysis and Dynamic Simulation

Add code
Dec 19, 2025
Viaarxiv icon

Spatially-Augmented Sequence-to-Sequence Neural Diarization for Meetings

Add code
Oct 10, 2025
Viaarxiv icon

Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs

Add code
May 27, 2025
Viaarxiv icon

Sci-LoRA: Mixture of Scientific LoRAs for Cross-Domain Lay Paraphrasing

Add code
May 24, 2025
Viaarxiv icon

Multi-Channel Sequence-to-Sequence Neural Diarization: Experimental Results for The MISP 2025 Challenge

Add code
May 22, 2025
Viaarxiv icon